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Abstract—The main proteinase of SARS-associated coronavirus (SARS-CoV) plays an important role in viral transcription and 
replication, and is an attractive target for anti-SARS drug development. The important thing is to understand its binding mech- 
anism with possible ligands. In this study, we investigated possible noncanonical interactions, potential inhibitors, and binding 
pockets in the main proteinase of SARS-CoV based on its recently determined crystal structure. These findings provide a wide clue 
to searching for anti-SARS drug. Interestingly, we found that similar structure patterns exist in SARS-CoV main proteinase with 
Poliovirus 3c Proteinase, Rhinovirus 3c Protease, Nsp4 Proteinase From Equine Arteritis Virus, Hepatitis C Virus Ns3 Protease, 
Hepatitis A Virus 3c Protease, and Dengue Virus Ns3 Protease. It suggests that the available drugs in these viruses could be used to 


fight SARS disease. 
© 2004 Elsevier Ltd. All rights reserved. 


1. Introduction 


Since November in 2002, a highly contagious pneumo- 
nia, severe acute respiratory syndrome (SARS), has 
spread rapidly in Asia, North America, and Europe. A 
new SARS coronavirus (SARS-CoV) has been identified 
as the etiological agent of the disease. Its seriousness lies 
in rapid transmission and high fatality (around 15%). 
However, the origin of SARS-CoV is still unknown, and 
no effective drug or vaccine is available up to now. 


The SARS-CoV replicase encodes two overlapping 
polyproteins, ppla and pplb, which mediate viral rep- 
lication and transcription. While a special main pro- 
teinase, 3C-like proteinase, is responsible for the 
cleavage of polyproteins, its functional importance 
make it an attractive target for drug development. 
Fortunately, the crystal structures of SARS-CoV main 
proteinase has been determined,'* which adopt a serine- 
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protease fold and is homologous to the main proteinases 
from human coronavirus and transmissible gastroen- 
teritis virus.? The goal of this study is to find potential 
inhibitors and locate the ligand-binding sites in SARS- 
CoV main proteinase based on comparison of nonho- 
mologous tertiary structures, hence to provide clues to 
rational drug design. 


2. Materials and methods 


The atomic coordinates of SARS-CoV main proteinase 
were downloaded from Protein Data Bank (ID 1Q2W). 
NCI program’ was used to identify noncanonical 
interactions. VAST  (http://www.ncbi.nlm.nih.gov/ 
Structure/VAST/vastsearch.html) and DALI (http:// 
www.ebi.ac. uk/dali/) programs were used to find similar 
structure patterns and in the main proteinase structure 
of SARS-CoV. The structure alignment was done by 
CE> and the structural comparison was performed by 
LGA.® The constituents of the binding pocket are 
determined by those residues that have at least one 
heavy atom (other than hydrogen) with a distance less 
than 5 A from a heavy atom of inhibitor, as did in Chou 
et al.’ The visualization of 3D structure was generated 
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by PROTEINEXPLORER 
plorer.org). 


(http://www.proteinex- 


3. Results and discussion 


The noncanonical interactions in SARS-CoV main 
proteinase structures are shown in Figure 1. There are 
two pairs of main chain-side chain interactions: Glu288 
(donor) and Trp 207 (acceptor), Ile152 (donor) and 
Phe8 (acceptor). There are four pairs of side chain—side 
chain interactions: Arg40 (donor) and Tyr54 (acceptor), 
Arg298 (donor) and Phe8 (acceptor), Prol84 (donor) 


Figure 1. Noncanonical interactions in the structure of SARS-CoV 
main proteinase. (A) The residue pairs involved are: Arg298 and Phe 8, 
Glu288 and Trp207, and Ile152 and Phe8, which are colored blue. (B) 
The residue pairs involved include: Arg40 and Tyr54, Prol84 and 
Phel85, and Tyr126 and Phe140, which are colored blue. The Cys-His 
catalytic dyad (Cys145 and His41) are colored green. 


and Phel85 (acceptor), and Tyrl26 (donor) and 
Phel40 (acceptor). These residues are marked as ‘@’ in 
Figure 2. 


Among these interactions, Phe8 accepts two N-H---1 
bonds in a sandwich fashion, one donated by a side-side 
chain Arg298, and one donated by a main-side chian 
Tle152, as existed in human racl.® Taken together with 
another N-H---2 interaction between Glu288 and 
Trp207, these noncanonical bindings connected N-ter- 
minus and C-terminus of the enzyme together, then 
fixed to Ile152 (domain I) and Trp207 (domain IIJ), this 
makes the domain IJ and HI not flexible due to the loop 
formed by above interactions in the right and a loop 
already existed in the left (Fig. 1A), that is stabilizes the 
structure of the protease. Furthermore, we turn to 
examine the remaining three pairs of interactions: Arg40 
and Tyr54, Prol84 and Phe185, and Tyr126 and Phel 40. 
It can be seen from Figure 1B that these noncanonical 
bindings will be able to stabilize the protein structure 
around the active center formed by Cys-His catalytic 
dyad (Cys145 and His41). On the other hand, after 
binding, the small helix Arg40 and His41 locate will 
expect to move up, the loop Phel40 and Cys145 locate 
and the loop Prol84 and Phel85 locate will expect to 
move down, hence make the active center more open for 
substrate binding (Fig. 1B). These results can be used for 
rational design of mutagenesis experiments and analysis 
of conservation of interactions at functional sites. In 
recent years, the noncanonical interactions have been 
shown to be important for the stability of protein 
structure?! and ligand recognition.!” 


The structurally similar nonhomologous virus proteins 
identified by VAST and DALI methods focus on 6 
proteases: 1CQQ (Rhinovirus 3c Protease With Ag7088 
Inhibitor), 1LIN (Poliovirus 3c Proteinase), 1MBM 
(Nsp4 Proteinase From Equine Arteritis Virus), 1DY8 
(Hepatitis C Virus Ns3 Protease with inhibitor FKI), 
1QA7 (Hepatitis A Virus 3c Protease with inhibitor 
NFA) and 1DF9 (Dengue Virus Ns3-Protease Com- 
plexed With Mung-Bean Bowman-Birk Inhibitor). The 
structure-based sequence alignment of the above six 
proteases with SARS-CoV main protease is shown in 
Figure 2. The results demonstrate that a number of 
similar beta sheets (marked by b) exist. In particular, 
there are three similar structural patterns showing con- 
servative residues (bold representation). The catalytical 
important residues are included in the first two highly 
conservative patterns: His-Cys or His-Ser catalytic 
dyads.”'"'8 In the third pattern, Gly is conservative in 6 
proteases except SARS-CoV main proteinase, the sig- 
nificance of mutation from Gly161 to Tyr161 in SARS is 
unclear. Anyway, the three structural patterns form the 
common active center of these seven enzymes, as seen in 
their superpositions (Fig. 3). 


Since these nonhomologous virus proteases share the 
same active center, we can consider the possibility of 
applying the inhibitors from these enzymes to SARS- 
CoV main proteinase. In fact, Anand et al.* has indi- 
cated that the inhibitor AG7088 may be modified 
to make it useful for SARS therapy. Chou et al.’ 
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IDY8:A 70 TL------- AGPKGPITOMYTNV------ DODLVGWPA- ---- PPGARSMT - - PCTCG- --SSD--LYLV 
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1DY8:A 140 TR----- HADVIPVRRRG- -------------------- DSRGSL------ LSPRPVSYLKGSSGGPLLC 
10A7:A 140 TTVNG-TPMLISEGPLKMEEKATYVHKKNDGTTVDLTVDQAWRGK- - --------- GEGLPGMCGGALVS 
1DF9:A 140 AL------ EPGKN- PRAVQ-------------------- TKPGLFKTNAGTIGAVSLDFSPGTSGSPIID 
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Figure 2. Structure alignment between SARS-CoV main proteinase (1Q2W) and other proteases: 1L1N (Poliovirus 3c Proteinase), 1CQQ (Rhi- 


novirus 3c Protease), 1 MBM (Nsp4 Proteinase From Equine Arteritis Viru 


s), IDY8 (Hepatitis C Virus Ns3 Protease), 1QA7 (Hepatitis A Virus 3c 


Protease), and 1DF9 (Dengue Virus Ns3-Protease). The residues marked as ‘b’ indicate similar beta-sheets. The bold residues indicate structurally 
similar patterns. The residues marked as ‘@’ make noncanonical interactions. The residues marked as ‘#’ make contact with inhibitors. 


Figure 3. The superposition of SARS-CoV main proteinase (1Q2W, 
white) with other proteases: 1L1N (Poliovirus 3c Proteinase, red), 
1CQQ (Rhinovirus 3c Protease, orange), 1MBM (Nsp4 Proteinase 
From Equine Arteritis Virus, yellow), 1DY8 (Hepatitis C Virus Ns3 
Protease, blue), 1QA7 (Hepatitis A Virus 3c Protease, gray), and 1DF9 
(Dengue Virus Ns3-Protease, green). 


conducted the docking studies of KZ7088 (a derivative 
of AG7088) to the SARS-CoV main proteinase (based 
on a theoretical model). Here, we studied the possibility 
of other inhibitors in anti-SARS drug development. 
Figure 4 shows the structures of docking other inhibi- 
tors from HAV, HCV and Dengue virus to SARS-CoV 
main proteinase, the residues that make contact with 
these inhibitors are marked as ‘#’ in Figure 2. Further- 
more, the corresponding binding pockets are shown in 
Table 1. The results reveal that these inhibitors can bind 
to SARS-CoV main proteinase well. 


Thus, the results above suggest that there is a link of 
SARS-CoV to human rhinovirus, Poliovirus, Arteritis 
virus, HAV, HCV, and Dengue virus. The inhibi- 
tors from these viruses could be modified to make them 
useful for SARS therapy. In fact, Cinatl et al.!° 
reported that human interferons, a medicine widely used 
in HCV patients, was useful for the treatment of 
SARS. Anyway, whether relationships between 
SARS-CoV and these pathogens exist or not, the present 
results provide a wide clue to search for anti-SARS 
inhibitors. 
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Figure 4. The interactions between SARS-CoV main protease (1Q2W, 
white cartoon) with different inhibitors, binding pockets are repre- 
sented by blue ball-stick. (A) FKI inhibitor from Hepatitis C Virus 
Ns3 Protease (yellow spacefill). (B) NFA inhibitor from Hepatitis A 
Virus 3c Protease (yellow spacefill). (C) Bowman-Birk inhibitor from 
Dengue Virus Ns3-Protease (yellow stick). 


Table 1. Binding pockets of SARS-CoV main proteinase for different 


inhibitors 
FKI inhibitor NFA Bowman-Birk inhibitor from 
from HCV inhibitor Dengue virus 
from HAV 
VAL 20 MET 17 GLN 19 PHE 140 
THR 25 VAL 18 VAL 20 LEU 141 
THR 26 GLN 19 THR 21 ASN 142 
LEU 27 VAL 20 CYS 22 GLY 143 
ASN 28 THR 21 GLY 23 SER 144 
PRO 39 THR 24 THR 24 CYS 145 
ARG 40 THR 25 THR 25 GLY 146 
HIS 41 THR 26 HR 26 SER 147 
VAL 42 LEU 27 LEU 27 TYR 161 
CYS 44 ASN 28 ASN 28 MET 162 
CYS 117 GLY 29 PRO 39 HIS 163 
TYR 118 HIS 41 ARG 40 HIS 164 
ASN 119 VAL 42 HIS 41 MET 165 
SER 139 GLN 69 VAL 42 GLU 166 
PHE 140 ALA 116 ILE 43 LEU 167 
LEU 141 CYS 117 CYS 44 PRO 168 
ASN 142 TYR 118 MET 49 THR 169 
GLY 143 ASN 119 LEU 50 GLY 170 
SER 144 GLY 120 ASN 51 VAL 171 
CYS 145 SER 121 PRO 52 HIS 172 
GLY 146 PRO 122 ASN 53 ALA 173 
SER 147 LEU 141 TYR 54 GLY 174 
TYR 161 ASN 142 LEU 57 PHE 181 
HIS 163 GLY 143 VAL 114 PRO 184 
HIS 164 SER 144 CYS 117 PHE 185 
MET 165 CYS 145 TYR 118 VAL 186 
GLU 166 GLY 146 ASN 119 ASP 187 
LEU 167 SER 147 TYR 126 ARG 188 
VAL 171 THR 135 GLN 189 
HIS 172 ILE 136 THR 190 
ALA 173 LYS 137 ALA 191 
PHE 181 GLY 138 GLN 192 
ASP 187 SER 139 ALA 193 
ALA 194 
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